Natural Language I

نویسنده

  • Bonnie Lynn Webber
چکیده

Except for the final presentation by Hovy, this session focussed on the use of superficial features of Natural Language in text processing (messages, in the case of the first two presentations, unrestricted text in the case of the second two). This is a very brief summary of a moderator 's view of the action. The first presentation was given by Ralph Grishman (NYU) describing work done by himself and John Sterling in preparing for the second ARPA workshop on message understanding (MUCK-2). Up to that point, their approach to message understanding had been to build a very rich semantics for a domain, which then could be accessed by their message processor in understanding ellipses, noun-noun complementation and other implicit relations, etc. Its very rich semantics gave their system a great deal of power. However, the restricted time participants were given to adapt their system to a new domain for MUCK-2 did not allow Grishman & Sterling to construct a uniform rich semantics for the whole domain. In his presentation, Grishman described their use of Wilk's Preference Semantics, which allowed them, in a short time, to capture some of the semantics of all the domain, rather than all the semantics of some of the domain, and thereby achieve a greater overall success in the MUCK-2 challenge. In the second presentation, Jerry Hobbs (SRI International) compared a variety of parse preference strategies that made use of syntactic criteria alone rather than at tempting to draw upon semantic and pragmatic criteria as well. For each of the preference strategies, Hobbs presented examples that would fail under that strategy. As one might expect, there was no one purely syntactic strategy that was found to improve results all around. The next two presentations discussed language statistics and their application to processing unrestricted text. The first of these was given by Ken Church (AT&T Bell Labs) who, in his alloted 15 minutes, presented the results of two separate pieces of work. He first described experiments carried out by himself and his colleagues (Gale, Hanks and Hindle) over several millions of words of text, to characterize co-occurrence relations among words in English texts. He then described other work by himself and Gale, in which they investigated two different methods of estimating the frequency of given bi-grams in a test corpus, given a training corpus one based on a method due to Good and Turing, the other, a purely empirical method they call Categorize-Calibrate or Cat-Cal. Their results led them to advocate the latter in the case of small counts, the former in all other cases. Julian Kupiec (Xerox PARC) then described a stochastic method for assigning part-of-speech categories to unrestricted text, in a way that eliminates the need for a pre-tagged training corpus and allows some word dependency across phrases. In the final presentation of the session, Ed Hovy (USC-ISI) advocated a new US effort in machine translation (MT) that would meld current transfer and inter-lingua approaches into a single approach that would take advantage of recent advances in grammatical theory. He characterized a twophase effort that would begin with a single modest MT project, and then move to a few small 3-5 person efforts working on limited application domains. Hovy emphasized that the need for MT (including machine-aided human translation, and human-aided machine translation) has not only not gone away, but can only increase, given the changes in Europe brought about by the European Community 's plans for 1992 and beyond, and the increasing economic inter-dependency of the Pacific Rim countries. He ended by asking that people interested in getting this new MT effort started contact him.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Goal Oriented Parsing: Improving The Efficiency Of Natural Language Access To Relational Data Bases

This paper is devoted to present a new approach to natural language understanding which is called here goal-oriented parsing. The interaction in natural language with a r t i f i c i a l systems (robots, data base systems, program generators, question-answering systems, etc.) does not require in most cases of actual interest a fu l l (human l ike) comprehension of natural language in all i ts d...

متن کامل

TEAM: A Transportable Natural-Language Interface System

A majo r b e n e f i t of u s i n g n a t u r a l l anguage to a c c e s s the i n f o r m a t i o n in a d a t a b a s e i s t h a t i t shifts onto the system the burden of m e d i a t i n g between two views of the d a t a : the way in which the da ta i s s t o r e d ( t h e " d a t a b a s e v i e w " ) , and the way in which an e n d u s e r t h i n k s abou t i t ( t h e " u s e r * s v i...

متن کامل

A Phrase-Structured Grammatical Framework for Transportable Natural Language Processing

The F I N I T E S T R I N G News le t te r Calls for Papers 151 Computer Assisted Information Retrieval, Theoretical Approaches to Natural Language Understanding, 23rd Annual Meeting of the ACL, IJCAI-85, Theoretical and Methodological Issues in Machine Translation of Natural Languages, International Pragmatics Programs 155 Coling84, International Workshop: Natural Language Understanding and Lo...

متن کامل

Word-Forming Process in Azeri Turkish Language

The subject intended to study the general methods of natural word-forming in Azeri Turkish language. This study aimed to reach this purpose by analyzing the construction of compound Azeri Turkish words. Same’ei (2016) did a comprehensive study on word-forming process in Farsi, which was the inspiration source of this study for Azeri Turkish language word-forming. Numerous scholars had done vari...

متن کامل

Model Generation without Normal Forms andApplications in Natural - Language

I present a new tableaux-based model generation method for rst-order formulas without function symbols. Unlike comparable approaches , the Relational Models (RM) tableaux calculus does not require clausal input theories. I propose some applications of the RM calculus in natural-language semantics and discuss its usefulness as an inference procedure in natural-language processing.

متن کامل

Natural Language Understanding with Distributed Representation

This is a lecture note for the course DS-GA 3001 〈Natural Language Understanding with Distributed Representation 〉 at the Center for Data Science1, New York University in Fall, 2015. As the name of the course suggests, this lecture note introduces readers to a neural network based approach to natural language understanding/processing. In order to make it as self-contained as possible, I spend m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989